Bayesian Analysis of Binary Data Subject to Misclassiication
نویسندگان
چکیده
This paper considers estimation of success probabilities of categorical binary data subject to mis-classiication errors from the Bayesian point of view. It has been shown by Bross (1954) that sample proportions are in general biased estimates. This bias is a function of the amount of misclassiication and can be substantial. Tenenbein (1970) proposed to eliminate the bias by subjecting a portion of the sample to both true and fallible classiiers, resulting in a 2 x 2 table, from which the misclassi-cation rates can be estimated. The rationale is that fallible classiiers are inexpensive relative to infallible ones. Hence if only a part of the sample is measured by the infallible classiier one can obtain a more eecient estimate, for a given sampling budget, than by measuring the whole sample using the infallible classiier. In many contexts an infallible classiier is unavailable or prohibitively expensive. Bayesian methods then provide a useful approach for dealing with the consequent nonidentiiability problems which arise when we want to carry out inference. In this paper we treat both the single measurement and the repeated measurements (where the former is a special case of the latter) from a Bayesian point of view. The posterior analyses are carried out using both Gauss-Jacobi quadrature and Gibbs sampling. Through examples it is shown that in most cases Gauss-Jacobi quadrature produces very good approximations, both in terms of accuracy and speed of computation. The Gibbs sampler requires more computation to reach the same level of accuracy as the Gauss-Jacobi.
منابع مشابه
The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کاملA Bayesian Approach to Estimate Parameters of a Random Coefficient Transition Binary Logistic Model with Non-monotone Missing Pattern and some Sensitivity Analyses
A transition binary logistic model with random coefficients is proposed to model the unemployment statues of household members in two seasons of spring and summer. Data correspond to the labor force survey performed by Statistical Center of Iran in 2006. This model is introduced to take into account two kinds of correlation in the data one due to the longitudinal nature o...
متن کاملDynamic Frailty and Change Point Models for Recurrent Events Data
Abstract. We present a Bayesian analysis for recurrent events data using a nonhomogeneous mixed Poisson point process with a dynamic subject-specific frailty function and a dynamic baseline intensity func- tion. The dynamic subject-specific frailty employs a dynamic piecewise constant function with a known pre-specified grid and the baseline in- tensity uses an unknown grid for the piecewise ...
متن کاملBayesian Determination of Sample Size in Longitudinal Studies with Binary Responses Using Random Effects Models
Sample size determination is important in all statistical studies including longitudinal studies. This is usually done by considering a target power to reduce the costs of sampling. Choosing the right sample size using efficient methods, ensures that the researcher achieve goal of the study, by spending the least amount of energy, time and money. In this article, using a method based on simulat...
متن کاملBayesian Analysis of Multivariate Probit Models
This paper provides a uni ed simulation-based Bayesian and non-Bayesian analysis of correlated binary data using the multivariate probit model. The posterior distribution is simulated by Markov chain Monte Carlo methods, and maximum likelihood estimates are obtained by a Monte Carlo version of the E-M algorithm. Computation of Bayes factors from the simulation output is also considered. The met...
متن کامل